Bilingual and dialectal adaptation and retraining
نویسندگان
چکیده
In this paper, we report our investigations on the use of adaptation and retraining in our bilingual (Italian, German) and multidialectal recognition system. Our approach for bilingual speech recognition is to assume the two languages as being one, which is best suited for a task where Italian and German natives speak both languages, resulting in a variety of accents and dialects. We performed adaptation on single speakers and speaker groups built from combinations of spoken and native language. Furthermore, we performed retraining on partitions of the adaptation or training data. Our experiments led to an error rate reduction in all cases: compared to the baseline system, we achieved an overall improvement of 14, 12–14 and 7 % for speaker adaptation, speaker group adaptation and retraining, respectively. Furthermore, we found among others that performance is rather stable for Italian between adaptation and retraining, while adaptation for German outperforms retraining by far.
منابع مشابه
Using a small development set to build a robust dialectal Chinese speech recognizer
To make full use of a small development data set to build a robust dialectal Chinese speech recognizer from a standard Chinese speech recognizer (based on Chinese Initial/Final, IF), a novel, simple but effective acoustic modeling method, named state-dependent phoneme-based model merging (SDPBMM), is proposed and evaluated, where a shared-state of standard tri-IF is merged with a state of diale...
متن کاملLearning Better Monolingual Models with Unannotated Bilingual Text
This work shows how to improve state-of-the-art monolingual natural language processing models using unannotated bilingual text. We build a multiview learning objective that enforces agreement between monolingual and bilingual models. In our method the first, monolingual view consists of supervised predictors learned separately for each language. The second, bilingual view consists of log-linea...
متن کاملDomain and Dialect Adaptation for Machine Translation into Egyptian Arabic
In this paper, we present a statistical machine translation system for English to Dialectal Arabic (DA), using Modern Standard Arabic (MSA) as a pivot. We create a core system to translate from English to MSA using a large bilingual parallel corpus. Then, we design two separate pathways for translation from MSA into DA: a two-step domain and dialect adaptation system and a one-step simultaneous...
متن کاملDeveloping a based-on play cognitive-behavioral educational package and determining its effectiveness on improving the language disorders and social adjustment of bilingual children
Background: The present study was conducted with the aim of developing a game-based cognitive-behavioral educational package and determining its effectiveness on improving the receptive language disorders and social adjustment of bilingual children. Methods: The current study was applied in terms of purpose and in terms of the nature of the data, it was semi-experimental with a pre-test-post...
متن کاملA New Framework for Domain Adaptation without Model Retraining
We propose a principled and effective domain adaptation framework that pursues the goal of Open Domain NLP (train once, test anywhere). Most domain adaptation frameworks adapt the models trained on the source domain data by retraining it on target domains (with a mix of labeled and unlabeled data). However, it is time consuming to retrain big models or pipeline systems, and may not even be feas...
متن کامل